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Included are an examination of the formal structure 
of information collection and recording activities, and a discussion 
of the analysis of the resulting data. Although specific application 
of these ideas is applied to a secondary school system, the 
applicability of the formal structures and analytic methods is 
thought to be general. In section I, traditional structures are 
formalized: the school system’s organizational structure and the 
functions, inputs, and outputs of the typical department. In section 
IT, methods of analysis ^or functions and data are explored. In the 
final section, general procedures are suggested for the collection 
and recording of requirements and for analysis. Possible utilization 
of the computer in manipulating the large volumes of data is 
explored. The appendix contains a brief review of the literature on 
collection and analysis methods. (PP) 
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ABSTRACT 



This report provides an extended view of the formal information 
struclures required in the requirements collection and recording phase 
of development. Analytic methods are suggested for use upon the result- 
ant data as guidance to implementation. The ideas in the paper are 
applied to a secondary school system by way of example but the appli- 
cability of the formal structures and analytic methods derived is 
thought to be general . 



INTRODUCTION 



As the scope of information systems development has broadened to 
encompass the "total" systems concept, the mere extension of the systems- 
and-procedures methods of identification of input, processes and output 
has not been completely adequate to the task of systems design. Tradi- 
tional methods have not contained information structures that reflect the 
nature of the total system, which may become a quite complex network of 
subsystems. In addition, traditional methods for single-function systems 
have not had to deal with the complex problem of deciding what to do 
first; that is to say, deciding which functional subsystem to implement 
first, or what systems-level data groups to consider first for machine 
support. Truly analytic methods directed to this problem have been 
totally lacking, with dependence placed upon the intuitive advice of the 
computer specialist or upon political exigencies. 

In view of this situation, the intent of the present report is to 
examine the formal structure of large information-requirements collection 
and recording activities, and to discuss the analysis of the resulting 
data. The specific application of these ideas to a secondary school sys- 
tem is occasioned by the availability of a recent requirements report for 
which the writer was partially responsible (6). Although that report has 
been used to provide the examples cited in this paper, it is the writer's 
view that formal structures and analytic methods are applicable to many 
types of organizations and information systems. 

A developmental approach has been taken in the paper in exploring 
structures and methods. In Section I, traditional structures are for- 
malized: first the organizational structure, and then the functions, 
inputs, and outputs of the typical department. Additional system-level 
structures and secondary structures needed for analysis are then defined. 

In Section II, methods of analysis for functions and data are explored. 
Section III, general procedures are suggested for the collection and 
recording of requirements and for analysis. Possible utilization of the 
computer in manipulating the large volumes of data is explored. Through- 
out the paper, the ideas are illustrated using a small segment of require- 
ments data extracted from the referenced report. Finally, the appendix 
contains a brief look at the literature on collection and analysis methods. 
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SECTION I: STRUCTURE 



1 1 The Functional/Organizational Structure 

In our model, the school system will be viewed as a community of 
user-processors of information, with a primary structure based on existing 
organization. This is essentially a functional^ structure with minor 
deviations . 

We will denote the system S and departments D, so that we may write: 



S = {D.} 



( 1 ) 



Since each department may also be a set of subdepartments, we have: 




( 2 ) 



We may then rewrite (1) above as 

s = {uh 

1 



(3) 



with {i> defining a partition of S according to lines of authority. 

The D-structure presented in Table I is based upon the organization 
of the Atlanta Public Schools as of Summer 1968. The structure is intended 
only as a representation, and no analysis or criticism will be made of it. 



Table I 

User/Processor Notation 

Dq Superintendent of Schools (Immediate Staff) 

Dj Controller 

^2 Administrative Service 

D 2 Instruction 

School Plant Planning and Construction 
Dg Staff Personnel 

Research and Development 



1. To avoid confusion, the mathematical concept of function will be 
always referred to as a mapping. Function here refers to an organizational 
work task and the term "process" will be reserved for sequences of functions. 
This usage is common. 




3 



“ll 


Area 


I Schools 


°12 


Area 


II Schools 


“l3 


Area 


III Schools 


“l4 


Area 


IV Schools 


°15 


Area 


V Schools 




X 



Controller (Immediate Staff) 

General Fund Accounting 
Cafeteria Accounting 
School Accounting 
Special Project Accounting 
Payroll Accounting 
Bond Accounting 

Budget/Savings Bonds/Summer School Tuition 
Information Processing 




Administrative Services (Immediate Staff) 
Food Services 

Maintenance and Operations 
Purcliasing 

Administrative Services (Misc.) 

Detectives 



Table I, cent. 



D 



0 








Instruction Clmmediate Staff) 

Curriculum Development and Supervision 
Athletics, Physical Education and Military 
Instructional Resources 
Pupil Services 

Vocational Education and Technical Education 

Project Organization 

dit 



0 

D4 School Plant Planning and Construction (Immediate Staff) 

d \ Plant Planning 

D4 Land 

3 

D4 Construction 

4 

D4 Equipment 



D5 Personnel (Immediate Staff) 

D5 Recruitment and Placement 

Certificated Personnel Support Services 
3 

D5 Non-cert if icated Personnel Support Services 



D6 

1)6 






Research and Development (Immediate Staff) 

Project Organization 

dit 




5 



Area I Office (Immediate Staff) 
Elementary School 



1^11 



D 



11 



Dll 



High School 



Each department may be considered as a subsystem with its own 
information requirements, functions and information products. In our 
already established set-theoretic notation we can refer to a department 
as a triple: 



D. = (S. , I. , 0.) 

1 11 * 

where S^ is the set of functions performed by department D. ; that is: 

the set of all functions of the school system, 
is the set of input data groups (or data sets) required by in 
the accomplishment of S, i.e.: 

I. = ah 

1 1 

2 

and similarly 0^ is the set of information outputs of department D. : 

0. = {o^} 

1 1 

Note that, in the case of functions, the union of all S. in the 

1 

system generates the set P, i.e.: 



'fhis is not so straightforward for inputs and outputs of D^. The sets of 
interest here are F(the set of unique data groups within the system) and 
R(the set of informational products of the system). The difficulty is 
highlighted by noting that an output of one may be the input to 
another. We can however note that: 



^ {I^} = F and {0^}^ (RIJF) 

where the elements of I^ incorporate data groups which are either trans- 
actional data or file records, and where the elements of 0^ may be either 

2. Other departmental products or outputs, eitner physical or 
intangible, are not shown in this presentation. 
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terminal outputs (formal reports) or data sets of F. 

Referring again to the requirements study (6) , we can extract as 
examples the (S, 1,0)— structures for two subdepartments. These are pre- 
sented in Table 11. 

Table 11. a. 

Structure of Payroll Accounting Department 
1. The S-Structure: Functions 



(Pl)l 


Payroll records maintenance 


(Pl)2 


Professional payroll preparation 


(pbj 


Non-professional payroll preparation 


(Pi >4 


Student payroll preparation 


(pbs 


Payroll funds accounting 




Tax and benefits reporting (periodic) 


The 1 -Structure: 


Input data groups 


(ibl 


New employee background data 


(ii)2 


Employee action transmittals 


(if)3 


Time/attendance -- personnel 


(il)4 


Time -- student 


(ibs 


Employee employment history 




Employee payroll history 


(iby 


Student payroll record 


The 0-Structure: 


Output data groups 



(of)j 

Cob 2 



Payroll checks 
Payroll register 



Table I I. a. cont. 



The 0 structure, cont. 





(01)3 


Deductions register 




(01)4 


Year-to-date register (salary and attendance) 




(ohs 


Automatic voucher register 




(01)^ 


Hospitalization and medical register 




(01)7 


Records corrections audit lists 




(°i )8 


Employee employment history 




COl)g 


Employee payroll history 






Tax forms 






Tax reports 




(0^12 


Employee savings bonds 






Table 11. b. 




2 

Structure of D 5 : Certificated Personnel Support Services 


The 


S-Structure: 


Functions 




(P 5 )i 


Career planning assistance 




(P|)2 


Payroll certification 




(pi) 3 


Professional certification, grants, and 
prior service 




(PS)4 


State reimbursement claims processing 




(P5)s 


Fringe benefits assessment 




(P|)6 


Records and Statistics handling 


The 


1 -Structure: 


Input data groups 




(i5)i 


Teacher career development actions 




(is) 2 


Placement notices 




(is') 3 


Employee background record (new) 



Table 11 .b. cont. 



The 1 structure, cont. 



(is) 4 


Personnel pos. /pay /re cord changes 


(is) 5 


Special earnings reports 


(is)6 


Employee employment history record 


(is) 7 


Employee payroll and position record 


(is)8 


Grant applications 


(15)9 


Grant allocation notices 


(ii^io 


Payroll step status 


(is) 11 


State fund allocation 


(is) 12 


Benefits schedules and descriptions 


(is^lS 


Personnel actions/communications 



3. The 0-structure: Output data groups 

Employee payroll and position record (new) 
Employee action transmittals 
Employee notifications 

Personnel classification and analysis report 
Personnel salary report 
Personnel certification report 
State teacher requisition report 
Personnel position lists 
Automatic salary increment report 



(05)1 

(oi)3 

(oi)s 

(05)7 

(oi)g 

(0^9 



1.2 Analysis of Structure: 



The structure which has been defined so far is the input-process- 

output inventory normally included in a systems analysis: all that is 

missing is the process flou chart linking the three and the system- 

level process-chart linking processes. 

Uliile it is not the primary concern of this paper, being well 

handled elsewhere (7,14), the intradepartmental structure i.e., the 

internal structure of d|) can be formalized by the use cz mathematical 

relations and deiicted in terms of matrices or graphs. For example, a 

2 

portion of department could be viewed as a directed graph; 




and relations of input to processes reflected by an inciaence matrix 
(where a "1" indicates that ij^ is an input to p^ and a "O’* indicates 
there is no connection) ; 





il 


12 


I 3 




15 


ie 


i? 


is 


io 


ilC 


ill 


il 2 


113 


Pi 


1 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


P2 


0 


1 


1 


1 


1 


1 

X 


1 


0 


0 


0 


0 


J 


0 


P3 


0 


0 


1 


0 


0 


1 


0 


1 


1 


1 


0 


0 


0 


P4 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


1 


0 


0 


P5 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


1 


0 


P6 


1 


1 


1 


0 


0 


1 


0 


1 


0 


0 


0 


0 


1 1 



Process -to-output relations can be shown by a similar matrix. A corre- 
sponding set-theoretic notation would take the form of ordered triples: 

2 

1 ^^ ~ ^ 1 * ^ ^ 2 * ^ 1 ^* ^^ 2 * ^ 2 * ^ 2 ^* ^ ’ 

an enumerative process. These examples are presented merely to show 
some of the formalisms now used to analyze system structure. To pursue 
this vein further, see one or more of the references noted above. 

Utilizing one or more of these notational techniques with matrix 
algebra, v/e are able to build substantial models of the department, of 
the functional sector of the school system, and of the complete system. 
There are techniques to reduce complexity, and meaningful analysis of 
the system can be performed using these representations. 

The structural preliminaries have the effect of forcing an 
exhaustive enumeration of the classes S, I, and 0 for each department. 

The immediate difficulties in simply generating department strictures 
indicate the incompleteness of data in the standard requirements report. 

Two factors must be considered; 

1. Too great a depth of detail must be avoided, else 
the manpower expenditure in collecting data is 
prohibitive; and 

2. Change becomes a detiorating factor as the depth 
of structuring is increased. 

Practical considerations work against a hard structuring of the 
type suggested. For one thing, it is difficult to get identical functions, 
inputs or reports so identified and named. If they are separated phys- 
ically, there is an inertia based on past usage and an intradepartmental 
bias to maintain separate names; this works against any effort to mech- 
anize even the purely structural aspects of analysis. The attempt in 
the subject study (6) to enumerate functional overlaps is something 
that might be expected of a structural analysis, but on the basis of 
the data as available and presented, it would be impossible to make a 
serious effort at such analysis. The inclusion of such data in the 
report was evidence of mental assimilation on the part of the project 
staff, rather than of a formal process. 
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Intra- and interdepartmeiiLal structures, however, are not tne pri- 
mary interest of this paper. We want to look at the sets P, F, and R, 
and to concern ourselves with structures which may be imposed on these 
sets during the requirements definition. TJirough the use of such 
structures, we hope to be able to make concrete recommendations about 
subsequent information system development. 

1 . 3 Structures of P, F, and R 

While the organizational (i.e., departmental) structure of the 
school system provides the most apparent and predominant ordering of 
the sets P, F and R, it is also the strongest deterrent to our ability 
to view the complete system. On each of the sets, therefore, we want 
to explore other ordering relations which might provide insight into 
the desired behavior of the school system as a whole for guidance in 
organizing processes and data for an automated environment. In this 
section we will postulate a variety of secondary structures, keeping 
in mind the need to further analyze and organize P, F, and R. We will 
assume that the definition of the departmental structure is complete, 
giving us an enumeration of the elements of P, F, and R as defined in 
Section 1.1. 

1.3.1 Structures on the Set of Functions P 

We have already noted the most prominent structure on the set of 
functions — the organizational structure of Section 1.1. This wo can 
note formally as a mapping: 

Rpo^ P D where the mapping relates a specific function 
to an organization. For example. 

Of course this is a posteriori according to cur previous definition; the 
new notation provides no further enlightenment as to the composition of 

P. 

We would like to extract one or more additional structures of the 
set of functions from the data supplied in the requirements. Three of 



these are identifiable and will be called program importance, organi- 
zational importance, and precedence. 

1) Program importance , Rpg, is a mapping on the 
set of functions which associates each function 
with an identified program within the school 
system. Formally: 

Rp(;; P G 

where G is the set of formal goal-oriented 
programs of the school system. The set G 
will be derived in Section 1.4. Because each 
of the members of G assumes a relative priority 
within the set, this priority carries over to, 
or is induced on, the elements of P which are 
mapped into G . 

2) Organizational importance , RpM, is a mapping of 
the set of functions which associates each 
function with one of the traditional activities 
within which the organization operates. Formally; 

Rpj^: P M 

where M is a set of activities based on traditional 
line-staff relationships. We will derive the 
elements of M in Section 1.4 and indicate an 
order on the set which, again, we will want to 
induce on P . 

3) Precedence , Rp^-, is a chronological ordering of 
functions based o.i inherent cycles of operations 
within the organization. It defines a set of 
processes into which each function can be mapped. 
Formally; 



^PW* P w 

Within W we have many processes, and each process 
wj^ is strictly ordered internally. This internal 
ordering of processes may be used to provide guidance 
as to the order in which the elements of P are to 
be developed. The set W is partially connected and 
does not induce a partition as do the other two 
structures; however, we will see that W still prove 
useful in analysis. 



1‘3.2 Structures on the Set of Data F 



As was the case with the set of functions, the most obvious struc- 
turing of data occurs as a result of the dominant departmental structure. 
We could also consider a structuring based upon program or activity, 
but such mappings are only indirectly useful to an analysis of the set F. 
Looking foi additional structures on F we find the intuitive structure 
based on the object for which the data set is an attribute. In addition, 
we find that the sets D and G each generate a measure which provides 
an important descriptive structure on F. 

Object attribute , Rpc, is a mapping of the data 
sets which corresponds to the intuitive struc- 
turing of data according to the real-world object 
it is describing. Formally: 

FL 

where C is the set of object classes. The 
elements of C which are pertinent in the school 
system are defined in Section 1.4. The set C 
is the primary partition of data for machine 
applications . 

2) Department use , Rpu* is a mapping of the data 

sets which is generated by the department structure. 

It maps each data set onto a measure of use or demand 
U. Formally; 



where U is a discrete measure in the positive 
integers. It might be termed a demand index, since 
it reflects the number of departments utilizing a 
specific data set. It will be briefly discussed in 
Section 1.4. 

Program use . Rpy, is a mapping similar to 
departmental use in that the set of programs 
generates a mapping of the data sets onto a 
value structure or measure V. Formally: 

Rpy: F V 
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where V is a measure over the positive real 
numbers. Again it is an index, associating 
with each data set a number reflecting the com- 
posite importance of the programs by which it 
is used. Its construction will be briefly dis- 
cussed in Section 1.4. 

Later in this paper we will present methods of generating these 
structures on F, and we will attempt to consolidate the three; our intent 
is to postulate an organization on F and a priority for implementation. 

i.S.3 Structures on the Set of Information Products R 



The set R provides us with information on the organization; for 
instance, a partial structuring of R tells us which departments are 
information generators and which are only users. Thus, we have to look 
elsewhere for internal order and structure for R. Three structures can 
be postulated, two (urgency and primacy) based on use, and one (source 
originality) based on function. 

1) Urgency , Rrt. Probably the most readily obtained 
attribute of a product is the frequency with which 
it is produced. While this tends to be predicated 
in terms of existing capabilities rather than of 
actual urgency , it does give us a measure of 
immediacy in terms of use. This attribute then 
can be described as: 

RrT : R T 

where T is a set of time intervals of increasing 
size. A complete enumeration will appear in Section 
1.4. 

2) Primacy (or importance) . Rp^. Eacn informational 
product is related to one dr more functional 
responsibilities of the system as reflected by 
the activity structure M. Primacy, then, is a 
mapping of the products onto an importance measure. 
Formally: 

Rrx: R X 

where X is a measure defined on the positive real 
numbers. The method of generation for X will be 
discussed in Section 1.4. 

- 15 - 
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3) Source originality , R]^. This is essentially a 
dichotomous partitioning of information products 
into those which are generated and increase or 
change F and those which are merely a regen- 
eration and restatement based on F. The gen- 
erated members of R usually result from data 
collections as a part of the function, and the 
regenerated members are usually thought of as 
terminal products. The mapping is formally: 

"rn= 

where N is a two-element set: source data and 

processed data. It is occasionally convenient 
tc consider some further division of processed 
data to better describe the output report, e.g.: 
analysis, summary tabulation, detail tabulation, 
projection, etc. However, our main reason for 
declaring the set N is to reflect that retained 
products do exist and are within our definition 
of data. We will make little or no use of termi- 
nal products . 



1-4 Summary of Structures With Examples 

The previous section identified several subordinate structures (or 
at least sublimated structures) which tend not to be brought forward in 
the analysis of requirements. It might be well now to summarize the con- 
tent of that section and to present examples of structures with which to 
work in later sections. Such a summary is found in Table III. 
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Table III 



Mapping 


Domain 


Range 


Rpp : organization 


P 


D : departments 


Rp^ : program importance 


P 


G : programs 


Rpj^j : org. importance 


P 


M : states of activity 


R_„ : precedence 


P 


W : processes 


Rp^ : object attribute 


F 


C : object classes 


R : program use 

r V 


F 


V : value index 


R ,, : department use 

rU 


F 


U : demand index 


Rp^ : urgency 


R 


T : time intervals 


Rp^ : primacy 


R 


X : importance index 


Rpj^. : source/orig. 


R 


N : source or proc. 

1 



It is critical to requirements analysis that both set G and set M 
be formally declared and that the priorities within each be agreed upon. 
Table IV will enumerate the programs and activities and establish ranking 
within the list. It should be noted that there can be no real incompat- 
ibility between the two sets, although some inconsistencies may show up 

3 

when related to the department organization. We will make multiple use 
of both G and M in structuring our system sets P, F and R. 



3. These were generated from discussions with Mr. Tom McConnell of 
the Atlanta Public Schools, but are intuitive and informal in nature, 
based on personal judgments. An attempt at formal definition is under 
way but in no way affects the general use as here proposed. 
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Table IV 





G 


M 


No. 


Element Name 


Prior. 


No. 


Element Name 


Prior. 


8l 


management 


1 


"■i 


oper'l instruct. 


1 


82 


oper. services 




m^ 


opcr'l services 


2 


83 


pupil services 


1 

J 


m^ 


oper’l admin. 


3 


84 


normal instruct. 


4 




mgmt. control 


4 


85 


athletics 


I- 

0 




mgmt. admin. 


5 


8& 


spec, instruct. 


6 


\ 


mgmt . planning 


6 


87 


program planning 


7 




staff services 


7 


88 


research/devel . 


S 


mg 


staff instruct. 


8 








mg 


staff admin. 


9 



The object classes C are tentatively drawn within the require- 
ments document without any real recognition of the process by which they 
were selected. That this is a very natural classification of data is 
admitted, but here we wish to call attention to it emphatically. With no 
regard to internal precedences, the set of classes of objects within the 
school system are shown in Table V. 



Table V 



The Elements of C 

: pupil data 

. personnel data 

^3 ■ financial data (incl. funding, budgeting, acctg.) 

C 4 : facilities data (incl. installed equip., land) 

c^ : equipment data 

: instructional resources 

4 

c_ : system structure data 



The set of processes W is best defined in certain specialized 
applications, such as process control; in general data processing, it 
tends to be well defined only in fragments, and then only after key 
decisions are made. One advantage of mechanical processing of function 
statements is the sorting out of processes. For our purposes here, we 
will indicate precedences among the elements of P as they exist in 
our examples in Section 1.1. Table VI enumerates the set W to the 
extent possible from our examples. 

In the structures we have chosen as our examples, we find sequences 
occurring only in w^, W 2 , and w^. If we were to add a school, or 

the Personnel Recruiting Department, for example, we would see more 
complete sequences, and the enumeration would be more complex. 



4. The latter set constitutes an external system data class which 
must be available and includes organization data, staffing structure, 
etc. 



Table VI 



The Elements of W 






w. 



w. 



w, 



Wr 



w. 



w 



8 



w 



10 



• • • 2 ^ 2 ^ • • • 

• • • (p 2^2^ 1^2^ 1^5^*** 

•••(Pl)l, (Pi)3. (Pi)5,... 

•••(Pl)4. (P])5.--- 

• • • 1 » • • • 

r 2. 

• • • ^P5^4» • • • 

• * • ^Ps^5* * ‘ ‘ 

7 



The sets V, U, and X are defined without difficulty. The set U is com- 
posed of positive integers, where u. is the number of departments using 
a data set; similarly, the set V is a set of positive reals reflecting 
again multiplicity of use (this time by programs) , but also incorporating 
the priorities. X is a set of positive rea’s reflecting multiplicity of 
use for products, again making use of the priority structure of M. Since 
the computations are straightforward, we will postpone examples until they 
are needed. 

The time intervals T are readily derivable from the requirements 
document, since they are among the concrete elements of reports de- 
scriptions. In Table VII we generate those time intervals found in 
Section 4.8 of the subject study (6). Finally, in Table VIII we list the 
set N for completeness. 
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Table VII 



The Elements of T 





daily 






weekly 




^3 * 


month ly 






20 days 




^5 ■ 


quarterly 






annual ly 






process cycle 




^8 • 


end of project (length 


of project) 


^9 • 


on demand/ as required: 


immediate 


^10 • 


on demand/ as required: 


elapsed time response 




Table 


VIII 




The Elements of N 


"i ■ 


source data 




: 


processed data 
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SECTION II: ANALYSIS OF REQUIREMENTS 



2.1 Introduction 

In the first section we presented a reasonably complete catalog of 
structures within the school system. Out of the primary system components 
(functions, P; data, F; and products, R) we obtain the raw resources for 
our information system. The secondary structures (see Table III) represent 
a variety of influences on the primary components; and, if viewed in some 
coordinated fashion, can provide us with considerable insight into design 
approach. A simple diagram of the relationships among structures may be 
helpful before proceeding; see Figure I. 

As indicated in Figure I, each element in P, F, and R has associated 
with it attributes induced by the mappings shown. This means that each 
element of P, for instance, has acquired as an attribute an element each 
from D, M, G, and W. Each of these attributes makes its demands on p^, 
imposing on P a complex structure. If we understand this structure then 
we will be better able to make the proper decisions concerning implemen- 
tation phasing. 

In this section we will investigate systematic techniques for deriving 
the induced structures of P and F, and we will inquire whether or not 
guidance can be obtained which is at least consistent with current intui- 
tive practice. 
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Figure I 




(Activities) 



(Programs) 





23 



2.2 Analysis of the Set P 



Taking the D-structures defined in Section 1.1 for the Payroll Ac- 
counting Department and the Certificated Personnel Support Services Depart- 
ment, our first requirement is to generate the mappings of P onto M, G, 
and W. This is shown in Table IX. 



Table IX 



Function 

Cp^) 


Program 

(gi) 


Activity 

(m^) 


Process 

(w^) 


(pi)i 


g2 


my 


Wl ,W2,W3,Wy 


CPl)2 


g2 


my 


W2 


(Pi)s 


g2 


my 


W 3 


(Pb4 


g2 


my 


W4 


CPl)5 


g2 


m^ 


W 2 ,W 3 ,W 4 


(pO, 


g2 


my 


W 5 


(P5)i 


g2 


my 


We 


(P5)2 


g2 


m 4 


Wl 


2 

(P5)s 


g2 


m0 


Wy 


2 

(P5)4 


g2 


mg 


W8 


(P5)s 


g7 


m0 


Wg 


2 

CP5)e 


g2 


my 


wio 



The partitioning of P according to D is reflected in the indexing of 
the p^ elements directly. By inspection we see that both G and M 
generate (induce) partitions on P. 



In our example' 
P/G 

and P/M 



Ug2^ [gyH 

{^CPl)l,...,(Pl)6.(Ppl 



n 



(P5)4.lF5l£, 



, 2 . 

. ^ ' 



J 9 



[m^J, [m^J, [m^j } 

^ 1 > • • • > > CP5) 6^ » 

Kp^sh’ (Pp3> fP5J4^' ^Ps^s’- 

In the case of W, however, the mapping is ambiguous in some cases, failing 
to produce clean partitions. The w^, however are well defined, and we can 
enumerate their contents in order; this was actually done during the 
generation of Table VI, Section 1.4. The w^'s there are partially Getj.ned. 
ordered sequences. The structure generates both precedences and connec- 
tivity for the p^ involved. Thus; 

2 5 5 5 5" 

P/K ; i{...(P5)2»(Pi)i»-->»^--CPi)i,(Pi)2’*^Pp5- 

■f- • 1^3 1^5** ' ^P 1 ^ 4 * ^P 1^5*' *■* * 



5 2 ' 2 5 

{ • • Cp Cp^ ^^ 5 ^ 3 ^ 



2 ' 2 2 

^ • (p ^ 5 ^ 5 ^**^^^** ^P 5^6^ 



}} 



We need to establish a precedence order between G and M and will 
therefore state that G has priority (precedence j over I", or; 

G > P 

This is reasonable, since G most closely approximates a goals -structure 
on the system. 

Our initial structure of P corresponds to P/r , aoeve. We now 
want to modify this structure using P/M. This we car do by considering 
a graph-theoretic representation of the partitions relative to both G 
and M (see Figure II). 



5. We “’ssume for convenience the notation cf algebraic quotient 
structures to reflect the partitioning of our primary sets P, F, and R. 
Hence: 

P/G denotes the partitioning of P induced by G and [giJc P/t 
indicates a partition or subset of P of all functions having like program 
mapping . 
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Figure II 




Here the graph indicates a partitioning of [g 2 ] by M, with the numbers 
on the edges indicating the number of elements of P in each partition. 
Using our predetermined priorities within G and M (from Section 1.4) 
and the precedence between G and M we have an immediate set of priorities 

Pri 1: [g 2 ] 

Pri 2: [g^] 



2 

Since (g^j has only a single member, (p^)^* we would like to further 
subdivide Priority 1. Here, an ambivalence develops. We may either 
take the priorities of M as our secondary order, resulting in: 

HI: Pri 1: 

Pri 2: [g 2 ]/m^ 

Pri 3: [g 2 ]/mg 
Pri 4: [g^J m^ 



consider 


the 


size of 


the 


H2 : Pri 


1: 




(7 


Pri 


2: 


[g2J/"»9 


(3 


Pri 


3: 


[g2J/"»4 


(1 


Pri 


4: 


[gy] 


(1 



subsets developed, i.e., 

members) 

members) 

member) 

member) 



It is at this point that we utilize the remaining structure on P, the 
set W. Since the elements of W (i.e., processes) are ordered sequences 



of which cut across G and M, we may find a solution. If we look at 
the functions contained in g2 we get a graph of the type shown in 
Figure III. If we delete the nodes (functions; which are isolated, we 
get an order graph such as that in Figure IV. 



Figure III 





Inspection of this graph provides us with a set of priorities which 
are intuitively attractive: 

Pri 1; [g 2 J/m^ 

Pri 2: [g2J/m^ 

Pri 3: [g 2 j/m^ 

Pri 4: [g^J/m^ 

In order to accomplish the [g 2 J/m^ function (fund accounting) , we have to 
generate the necessary data through the preceding functions which make up 
the pertinent processes within the financial system. This, then, is a 
natural sequence for implementation. 

2.3 Analysis of System Data: the Set F 

Given a complete and exhaustive definition of the system in terms of 
the department structure (S,I,0), we are in a position to generate the 
system data set F. This is essentially a tabulation process, in which 
duplicate use of a data group, i^, is recorded, generating for us the 
corresponding use index u^ e U and the program-value index v^ r, V according 
to its required use within the system. This is convenient, since we have 
previously made the linkage of program to function. We will use oniy the 
I-structure to generate the set F, i.e.: 




All nonterminal outputs, those which are source data sets to be reused by 
the system, will be picked up in the I-inventory. 

In generating the set F, we wish also to establish the object class 
associated with each data set through the R mapping. While the U- and 
V-generation was mechanical in nature, this assignment of object class is 
analytical and to a degree arbitrary. Table X presents a tabulation of 
data sets from our example structures (Section 1.1). It was necessary to 
extrapolate to obtain the U and V values, but the accuracy is adequate for 
our purposes. 
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Table X 



tlement 

(fij 


Description 


Cross/Ref. 
(Sect. 1.1) 


T 

Object 

Class 


User^ 

Demand 


•7 

Value' 

(V.) 




new emp bkgrd 


(i532 


^2 


4 


2.5 


^2 


•-.iipl action 


(if )2 


^2 


4 


3.0 




time/att-pers 


(if)3 


^2 


3 


2.0 


^4 


time-student 


(il)4 


^1 


3 


2.0 


^5 


emp history 


(ibs.Cii)^ 


^2 


10 


3.9 


"6 


emp pay hist 


Cib6.(i5)7 


^2 


7 


3.4 


^7 


stud pay hist 


( 11)7 


^1 


3 


2.0 


^8 


career dev ac 


(is)i 


^2 


4 


3.0 


^9 


place notice 


(is) 2 


^2 


4 


3.0 


^10 


emp p/p/r chg 


(i|)4 


^2 


5 


3.5 


^11 


emp spec pay 


ah, 


^2 


4 


3.0 


^12 


emp grant apl 


(is)8 


^2 


3 


2.5 


^13 


emp grant ale 


(is)9 


^3 


5 


3.5 


^14 


pay step stat 


(iS)jo 


^3 


5 


3 5 


^15 


state fund al 


ahn 


^3 


5 


3.5 


^16 


bene sched/de 


( 15)12 


^7 


5 


3.1 


^17 


pers commu:i 


(ishs 


^2 


3 


2.0 



6 . These numbers were generated from the requirements study based on 
the writer's participation in the collection cycle and utilize the data 
matrix in Section 4.8 of the study. They indicate the number of second- 
l“vel departments having the requirement. 



7. Using the departments reflected in user demand, the program 
priority related to each function was generated. A composite rank was 
then computed using the following formula: 



= Z 



D 



for 

1 



included in u. 

1 



V. 

1 



2.3.1 Partitioning of Data by Object Class 



The most natural subdivision of the set F is according to the related 
object classes. From the table we see the following partitioning: 

F/C : {[Cj], [C2J, [c^], [c^J> 

where [c^J= if^,f^,..} : pupil data 

[c^]- ^ 2 ’^3 ’^5 ’^6 ’^8 ’^9 ’^10 ’^11 ’^12 ’^17’ ’ ' ^ 

: personnel data 

[c^J= _ , . . } : financial data 

[c^]= if^^,..} : system data. 

Based on the data itself, we cannot legitimately say anything about the 
relative importance of object classes. Each class is necessary to the 
functioning of the school system; only within subsystems might such 
comparisons be made. We are in a position, however, to say something 
about the relative importance of data sets within an object class. To 
illustrate, we will use the U and \' mappings identified in Section I; 
the data sets from our two example departments; and the Table X estimates. 
Only [C 2 ] , personnel data, has a significant number of data elements 
included, but we will briefly look at all four classes. 

2.3.2 Importance Within Object Class 

Looking first at [c^] , pupil data, we see that successive application 
of partitioning by U and V produces no results: 

[Cj]/U = [Cj] = 

[Cj]/U/V = (cj] = 

No order of importance can be derived, since both f^ and f^ have a U-index 
of 3.0 and a V-index of 2.0. This is due to the lack of a significant 
portion of [c^] available. The relative importance in the table indicates, 
however, that these two data sets are probably not of maior importance 
within their class. 

Looking next at [C 2 ] , personnel data, we get a partitioning of the 
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data sets which produces a spectrum of importance; 



[c^J/U =i[u3j. 


^“ 4 ^’ 1^5^* [u^qJ] 


which by order of importance becomes : 


11: 


[Uiq] : 


^5 


12: 


tu, ] : 


^6 


15: 


^ • 


^10 


14: 


[**4 J : 




15: 


[Uj ] : 


^3 ’^12 ’^17 


If we further 


order [u^J 


using V, we get: 




[UjJ/V = 


Uv2 qL 1^2 jJ} 


where 


^’'2.0^ ^ 






'’'2.5^ = 


{fl2> 


If we further order [u^J 


on V, we get; 




lu^j/V = 


[V 2 5J,[v2 qJ 


where 


^^2.5^ • 


(fl> 




^'^3.0^ • 




Incorporating this substructure derived from V, we obtain the 


overall order 


of importance structure on [C 2 J , personnel data: 


11 : 


^5 


: employee history 


12 : 


^6 


: employee payroll history 


13: 


^10 


: personnel pay/pos/rec changes 


14: 




fj 2 : personnel actions notices 


15: 


"1 


: new employee data 


16: 


^12 


: personnel grant application 


17: 


^3’^17 


: time reporting/ correspondence 



following 
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This is not necessarily an implementation order; an implem.entation 
order will be obtained via the P-analysis. This ordering might be termed 
a machineability index, or something similar. In this view we might say: 

11 § 12 : permanent global records 

13 - 15 : transactional records; facilitating and 
informational 

I6 5 17 ; informational; not machined 



This order corroborates our intuitive sense of what is important among 
data sets . 

The item (data set) which intuitively seems out of place is f_ 

O 

(employee time reporting , since employees here are salaried, and time and 
attendance reporting is largely pro forma ; therefore we have little depart- 
mental interest beyond accounting. (As an aside, in the school system 
now this is a periodic report prepared on optical scan sheets.) 

Looking at [c^], financial data, we find the same difficulty we had 
with ; neither U nor V provide us with a partition. 



[C3] U= [03]= 

tc3]/V = [€3]= tfj3 



f f } 
* 14 * 15 ^ 



f f } 
* 14 *^ 15 ^ 



Here, for each data set, the U-index is 5 and the V-index 3.5. Although 
we again have too small a subset of to work with, we might expect 

that the class of financial data may well be more cohesive in terms of 
scope of interest than some of the other classes; we note that our three 
sets from [c^] had a generally higher index of importance than either [c^] 
or [c^] . 

Since only one data set from [c^J is shown, no comment can be made 
about the class of system data. 

Looking back to our ordering of personnel data, we need to comment 
briefly on disparities in size and content of data sets. At our level of 
definition we cannot say much about the content of the basic history 
records, which show the greatest importance. They are aggregate records, 
though, and if we exclude them from our list we should find that the 
transactional data provides at least a potential source of content infor- 
mation. An inspection of the transactional data sets in our example 
(13-15) corroborates this view. 
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SECTION III 

RECOMMENDATIONS AND COMPUTATIONAL APPROACH 

3.1 Plan of Action 

In current practice the requirements definition and analysis phase 
is probably most arbitrary among the stages of development. It is uni- 
formly recognized that formal analysis methods are highly dependent upon 
a complete and accurate definition of organizational functions and infor- 
mation requirements. That this kind of definition is not routinely obtain- 
able is also recognized. Langefors, for example, as most workers do, 
acknowledges the difficulty and proceeds with his development of analytic 
method. To highlight these difficulties, we have used a specific study 
(by no means a poor one) as source material in this paper. In this sec- 
tion, we will state the necessary content of such a report. 

3.2 Derivation of Primary System Sets and Attributes 

The great bulk of initial work is definitional in nature, comprising 
an inventory of a variety of structures. This can be seen to take place 
naturally in four stages: 

3.2.1 Derivation of D, G, M, and C 

Specification of these four sets provides the basic skeleton upon 
which the system is constructed. The set D is the most available and 
apparent. Its specification is a recording function in which notational 
devices are added for convenience. In many organizations some such coded 
notation is already available; this is especially true of the Federal 
government and military establishment. 

The set of object classes C is usually intuitively obtainable, with 
two exceptions. Ambiguities occasionally arise because of confusion 
between the object of discussion and the intended uses of information 
related to the object. A man’s pay and payroll funds must be distin- 
guished as descriptive of different object classes • i.e. personnel and 



8. See Langefors, Vol. II, Chapter 5, pp. 224-225. 



funds. In addition, there exists that set of information which can be 
said to pertain only to the system as a whole; for example, authorized 
manpower structure, funding authority, or external environment. These 



The definition of the sets G and M are the most difficult, since 
they require a high-level understanding of how the organization func- 
tions and should function. Senior management must therefore be respon- 
sible for defining these sets and establishing internal priorities. G 
should closely match the organizational objectives, while M should re- 
flect the operational functioning of the organization. 

3.2.2 Derivation of the (S, I, 0) -structure 

Using the D-structure as a checklist, it is necessary to define, by 
department, the elements of S, I, and 0 and to assign secondary attri- 
butes. For each p^, the program, activity, predecessor function and 
successor function must be identified; for each i^, the object class, 
programs, and activities must ^e identified. (Note that multiple assign- 
ments of program and activity to a data element is to be expected.) Sim- 
ilarly, to each o^, frequency, activities, and source/processed must be 
assigned. 

This is a major cataloging activity; and many of the deficiencies 
in requirements -analysis stem from the inability to carry this out ade- 
quately. For success, two factors are important. First, derivation 
must be conducted uniformly and at a feasible level of detail; greater 
detail is acceptable only if it requires no additional expenditure in 
collection or in processing. Second, precise uniformity in descriptive 
terminology is required so that identical or related elements may be 
recognized. 

3.2.3 Derivation of P, F, and R 

Once the work of the first two steps have been completed, use of the 
computer may be considered. We wish to generate P from the sets S^ and 
we wish to generate F from the input sets I^; if the work in step 2 above 
was successful, this is a straightforward machine task. Necessary con- 
solidations and sorting can be performed efficiently and quickly by 
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computer. As byproducts of computer processing we can identify ambig- 
uities, overlaps (a problem in manual consolidation), and omissions due 
to incomplete collection. In addition, computer generation of these sets 
allows us to use the machine again in subsequent steps. 

3.2.4 Generation of W and Indices on F and R 

Working from the predecessor and successor data on each element of P, 
the computer provides an efficient generator for what is a most tedious 
hand-operation--the generation of the set of processes W. This is a two- 
step process: the computer is used first to build strings representing 
candidate processes, and then to eliminate redundancies and to consolidate. 
(Some manual editing may be required to reflect discontinuities not visible 

from the data itself; these revisions should be incorporated into W before 
proceeding ) . 

Working from the attributed values of G and M, the computer can be 
used to calculate values of U and V for the elements of F, and values of 
X for the elements of R. Values for U and V are then added to the indi- 
vidual f^ records, and values for X to the corresponding r^ records. 

3.3 Procedures for Preliminary Analysis 

The developments in Sections 2.2 and 2.3 were experimental and demon- 
strative in nature, intended to suggest ways in which specific guidance to 
the implementation of an information system could be derived from the re- 
quirements data alone. The methods, however, need further use over more 
complete data before being formalized. In this section, then, we can 
merely suggest some general procedural approaches to the analysis of 
requirements data. 

3.3.1 Analysis of P 

In Section 2.2 we generated successive partitions (refinements) of 
P utilizing the induced attributes from G, M, and W, and producing an 
order on the subsets of functions so obtained. The procedure is: 

1. Partition P according to G and associate with 
the subsets generated the priority of the 
corresponding g^ 
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2 . 



Partition the elements of P/G according to M. 



3. Generate a precedence-order based on W for each 
element of P/G/M which is populous, i.e., which 
has more than one function within it. Use graph- 
theoretic methods to establish internal priorities. 

4. Use priorities of M or counting methods for breaking 
ties, or for further ordering where necessary. 

because of natural ties and ambiguities which may develop in the 
use of the W-structure, it may be advantageous to use an interactive 
method, in which the computer is used for clerical work and decisions 
are reserved for a human analyst. In any event, further experimentation 
is needed before more precise procedures can be stated. 

3.3.2 Analysis of F 



In Section 2.3 we considered each object class of data separately 
and used the computed index values to derive an order of importance among 
the data sets. Since -^ome of the critical decisions which were made are 
not readily formulated, we can only outline an approximate procedure: 

1. Sort the set F by object class and inspect each 
separately. 

2. Sort data sets within an object class and group 
them according to decreasing U-index. Initial 
assignment of importance is according to highest 
U-vaJue down- 

3. Where more than one data set occupies a U-value, 
the group is again ordered, this time by decreasing 
V-index. Since values of V are not necessarily 
discrete, it is necessary to impose arbitrary 
half-closed intervals prior to ordering. This, 
however, should cause no difficulty in processing. 

Overall importance-ranks are then reassigned 
incorporating this secondary order. 

4. The list is scanned top-down, removing those data 
sets which are permanent global files. This 
requires human judgment; however, once identified 
as such, this should become an element of identi- 
fication for the set. 
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5. The list is scanned bottom-up, removing all data 
sets with importance below a minimal UV-value. 

These items will be culled manually to insure 
that important data sets are not discarded, but 
generally these will be items which should not 
be considered in automation. 

The resultant list, possibly renumbered, provides guidance as to 
those data sets which must be incorporated into global files and indi- 
cates an order of importance for machine conversion. 

3.4 Summary 

In this paper an attempt has been made to contribute a level of 
formality to the requirements phase of system development and to the 
system- level aspects of the subsequent analysis. The intent has been 
to replace much of the intuitive approach to implementation with analytic 
method. Methods have been displayed which generate concrete recommen- 
dations to implementation planning. Finally, the use of the computer as 
an aid to analysis of requirements has been suggested. 
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APPLWDIX 
LITERATURE NOTES 



Beyond the restricted methods of traditional systems-andr-procedures 
workers, little has been written concerning formal recording and analysis 
methods for the requirements phase. In the business systems area, a few 
efforts of minor interest can be noted. Grindley (3,4,13) in Great 
Britain has suggested a recording method he calls "Systematics” which 
provides a structure that builds from object classes. Sussams (12), also 
of Great Britain, suggests a charting method which records "operators" 
(programs, specifications), "operands" (material, data), and "operatives" 
(men, machines) in an attempt to deal with the complexity of information 
system design. In this country, Stevens (11) suggests the use of a 
machined file for recording an even more detailed file on data elements 
than is suggested in this paper. 

Granting that the essential recording --in particular the recording 
of the (S, I, 0) structure -- has been accomplished, a number of sub- 
stantial works deal with the analysis of this material. Langefors (7), 
Salton (9,10), and Zunde (14) attack the analysis of information systems 
with linear algebra and graph theory. Graph-theoretic methods provide a 
powerful handle for the processing of structured data. Two works, those 
of Harary et. al . (5) and Busacker and Saaty (1), provide a broad entry 
into this material. Finally, two papers are included in the bibliography 
which are suggestive of ways in which graph-theoretic methods can provide 
excellent results. In Luckman's paper (8), the use of option graphs 
eliminates difficulties of interdependence in design decisions. Carroll 
(2) uses graph methods to analyse activity within an existing system. 
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